Overview

Dataset Statistics

Number of Variables 23
Number of Rows 36936
Missing Cells 152380
Missing Cells (%) 17.9%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 12.7 MB
Average Row Size in Memory 360.7 B
Variable Types
  • Categorical: 8
  • Numerical: 14
  • DateTime: 1

Dataset Insights

ID is uniformly distributed Uniform
CapacityFactor has 36936 (100.0%) missing values Missing
ArrayRatio has 36936 (100.0%) missing values Missing
Generation(kWd) has 36936 (100.0%) missing values Missing
HourlyCloudAmount has 32428 (87.8%) missing values Missing
HourlyPrecipitation has 9144 (24.76%) missing values Missing
Angle is skewed Skewed
Capacity is skewed Skewed
Irradiance(kWd/m2) is skewed Skewed
ClearSkyIrradiance(kWh/m2) is skewed Skewed
Irradiance(kWh/m2) is skewed Skewed
HourlyCloudAmount is skewed Skewed
HourlyPrecipitation is skewed Skewed
Date has a high cardinality: 112 distinct values High Cardinality
Set has constant value "test" Constant
Set has constant length 4 Constant Length
Date has constant length 10 Constant Length
CapacityFactor has all distinct values Unique
ArrayRatio has all distinct values Unique
Generation(kWd) has all distinct values Unique
Angle has 13200 (35.74%) negatives Negatives
Angle has 7968 (21.57%) zeros Zeros
ClearSkyIrradiance(kWh/m2) has 20007 (54.17%) zeros Zeros
Irradiance(kWh/m2) has 19954 (54.02%) zeros Zeros
HourlyPrecipitation has 23862 (64.6%) zeros Zeros
  • 1
  • 2
  • 3

Variables


Set

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2548584

Length

Mean 4
Standard Deviation 0
Median 4
Minimum 4
Maximum 4

Sample

1st row test
2nd row test
3rd row test
4th row test
5th row test

Letter

Count 147744
Lowercase Letter 147744
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • Set has words of constant length

ID

numerical

Approximate Distinct Count 1539
Approximate Unique (%) 4.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean 770
Minimum 1
Maximum 1539
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • ID is uniformly distributed

Quantile Statistics

Minimum 1
5-th Percentile 77
Q1 385
Median 770
Q3 1155
95-th Percentile 1463
Maximum 1539
Range 1538
IQR 770

Descriptive Statistics

Mean 770
Standard Deviation 444.277
Variance 197382.0106
Sum 2.8441e+07
Skewness 0
Kurtosis -1.2
Coefficient of Variation 0.577
  • ID is not normally distributed (p-value 0.001707976880722162)

Date

categorical

Approximate Distinct Count 112
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Memory Size 2770200

Length

Mean 10
Standard Deviation 0
Median 10
Minimum 10
Maximum 10

Sample

1st row 2021-10-29
2nd row 2021-10-29
3rd row 2021-10-29
4th row 2021-10-29
5th row 2021-10-29

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 73872
Decimal Number 295488
  • Date has words of constant length

Lat

categorical

Approximate Distinct Count 9
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2590656

Length

Mean 5.1391
Standard Deviation 0.346
Median 5
Minimum 5
Maximum 6

Sample

1st row 25.11
2nd row 25.11
3rd row 25.11
4th row 25.11
5th row 25.11

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 152880

Lon

categorical

Approximate Distinct Count 8
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2619792
  • The largest value (120.52) is over 1.97 times larger than the second largest value (121.26)

Length

Mean 5.9279
Standard Deviation 0.2587
Median 6
Minimum 5
Maximum 6

Sample

1st row 121.26
2nd row 121.26
3rd row 121.26
4th row 121.26
5th row 121.26

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 182016
  • The largest value (12052) is over 1.97 times larger than the second largest value (12126)

Angle

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean -17.3875
Minimum -160
Maximum 22
Zeros 7968
Zeros (%) 21.6%
Negatives 13200
Negatives (%) 35.7%
  • Angle is skewed left (γ1 = -2.1267)

Quantile Statistics

Minimum -160
5-th Percentile -160
Q1 -2.62
Median 0
Q3 4.63
95-th Percentile 22
Maximum 22
Range 182
IQR 7.25

Descriptive Statistics

Mean -17.3875
Standard Deviation 47.832
Variance 2287.8999
Sum -642223.92
Skewness -2.1267
Kurtosis 3.277
Coefficient of Variation -2.7509
  • Angle is not normally distributed (p-value 1.6081017169036972e-17)
  • Angle has 10560 outliers

Module

categorical

Approximate Distinct Count 4
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2981328
  • The largest value (AUO PM060MW3 320W) is over 3.01 times larger than the second largest value (MM60-6RT-300)

Length

Mean 15.716
Standard Deviation 2.082
Median 17
Minimum 12
Maximum 17

Sample

1st row MM60-6RT-300
2nd row MM60-6RT-300
3rd row MM60-6RT-300
4th row MM60-6RT-300
5th row MM60-6RT-300

Letter

Count 255936
Lowercase Letter 0
Space Separator 52752
Uppercase Letter 255936
Dash Punctuation 23808
Decimal Number 247992
  • The top 2 categories (AUO PM060MW3 320W, MM60-6RT-300) take over 50.0%

Capacity

numerical

Approximate Distinct Count 14
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean 335.6543
Minimum 99.2
Maximum 499.8
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Capacity is skewed left (γ1 = -0.3129)

Quantile Statistics

Minimum 99.2
5-th Percentile 99.2
Q1 267.52
Median 314.88
Q3 492.8
95-th Percentile 499.8
Maximum 499.8
Range 400.6
IQR 225.28

Descriptive Statistics

Mean 335.6543
Standard Deviation 132.4449
Variance 17541.6577
Sum 1.2398e+07
Skewness -0.3129
Kurtosis -0.8659
Coefficient of Variation 0.3946
  • Capacity is not normally distributed (p-value 5.606280189524647e-14)

CapacityFactor

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2511648

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row nan
2nd row nan
3rd row nan
4th row nan
5th row nan

Letter

Count 110808
Lowercase Letter 110808
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • CapacityFactor has words of constant length

ArrayRatio

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2511648

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row nan
2nd row nan
3rd row nan
4th row nan
5th row nan

Letter

Count 110808
Lowercase Letter 110808
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • ArrayRatio has words of constant length

Generation(kWd)

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2511648

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row nan
2nd row nan
3rd row nan
4th row nan
5th row nan

Letter

Count 110808
Lowercase Letter 110808
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 0
  • Generation(kWd) has words of constant length

Irradiance(kWd/m2)

numerical

Approximate Distinct Count 294
Approximate Unique (%) 0.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean 3.3784
Minimum 0.2611
Maximum 5.6111
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Irradiance(kWd/m2) is skewed left (γ1 = -0.5668)

Quantile Statistics

Minimum 0.2611
5-th Percentile 0.7556
Q1 2.2778
Median 3.8667
Q3 4.525
95-th Percentile 5.1194
Maximum 5.6111
Range 5.35
IQR 2.2472

Descriptive Statistics

Mean 3.3784
Standard Deviation 1.3986
Variance 1.9561
Sum 124785.7333
Skewness -0.5668
Kurtosis -0.9353
Coefficient of Variation 0.414
  • Irradiance(kWd/m2) is not normally distributed (p-value 3.5009769164837214e-08)

Datetime

datetime

Distinct Count 2691.5206
Approximate Unique (%) 7.3%
Missing 0
Missing (%) 0.0%
Memory Size 295616
Minimum 2021-10-29 00:00:00
Maximum 2022-02-17 23:00:00

ClearSkyIrradiance(kWh/m2)

numerical

Approximate Distinct Count 13300
Approximate Unique (%) 36.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean 0.2005
Minimum 0
Maximum 0.8417
Zeros 20007
Zeros (%) 54.2%
Negatives 0
Negatives (%) 0.0%
  • ClearSkyIrradiance(kWh/m2) is skewed right (γ1 = 0.9435)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0.4296
95-th Percentile 0.7265
Maximum 0.8417
Range 0.8417
IQR 0.4296

Descriptive Statistics

Mean 0.2005
Standard Deviation 0.2784
Variance 0.07748
Sum 7404.9775
Skewness 0.9435
Kurtosis -0.7834
Coefficient of Variation 1.3884
  • ClearSkyIrradiance(kWh/m2) is not normally distributed (p-value 5.430420510606428e-25)

Irradiance(kWh/m2)

numerical

Approximate Distinct Count 300
Approximate Unique (%) 0.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean 0.1244
Minimum 0
Maximum 0.8667
Zeros 19954
Zeros (%) 54.0%
Negatives 0
Negatives (%) 0.0%
  • Irradiance(kWh/m2) is skewed right (γ1 = 1.6402)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0.1833
95-th Percentile 0.6222
Maximum 0.8667
Range 0.8667
IQR 0.1833

Descriptive Statistics

Mean 0.1244
Standard Deviation 0.2034
Variance 0.04139
Sum 4596.3833
Skewness 1.6402
Kurtosis 1.5345
Coefficient of Variation 1.6348
  • Irradiance(kWh/m2) is not normally distributed (p-value 5.375868630960878e-25)
  • Irradiance(kWh/m2) has 4019 outliers

HourlyTemperature

numerical

Approximate Distinct Count 219
Approximate Unique (%) 0.6%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean 18.5635
Minimum 8.9
Maximum 31.6
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • HourlyTemperature is skewed right (γ1 = 0.5509)

Quantile Statistics

Minimum 8.9
5-th Percentile 13.8
Q1 16.1
Median 18
Q3 20.8
95-th Percentile 24.8
Maximum 31.6
Range 22.7
IQR 4.7

Descriptive Statistics

Mean 18.5635
Standard Deviation 3.4182
Variance 11.6844
Sum 685660.3
Skewness 0.5509
Kurtosis -0.03403
Coefficient of Variation 0.1841
  • HourlyTemperature has 284 outliers

HourlyHumidity

numerical

Approximate Distinct Count 70
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean 78.0745
Minimum 31
Maximum 100
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • HourlyHumidity is skewed left (γ1 = -0.4395)

Quantile Statistics

Minimum 31
5-th Percentile 57
Q1 70
Median 79
Q3 87
95-th Percentile 96
Maximum 100
Range 69
IQR 17

Descriptive Statistics

Mean 78.0745
Standard Deviation 12.0696
Variance 145.6758
Sum 2.8838e+06
Skewness -0.4395
Kurtosis -0.1952
Coefficient of Variation 0.1546
  • HourlyHumidity has 210 outliers

HourlyWindSpeed

numerical

Approximate Distinct Count 136
Approximate Unique (%) 0.4%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean 3.9506
Minimum 0
Maximum 14.7
Zeros 48
Zeros (%) 0.1%
Negatives 0
Negatives (%) 0.0%
  • HourlyWindSpeed is skewed right (γ1 = 0.7183)

Quantile Statistics

Minimum 0
5-th Percentile 0.5
Q1 1.6
Median 3.2
Q3 6
95-th Percentile 9.4
Maximum 14.7
Range 14.7
IQR 4.4

Descriptive Statistics

Mean 3.9506
Standard Deviation 2.882
Variance 8.3059
Sum 145919.9
Skewness 0.7183
Kurtosis -0.4178
Coefficient of Variation 0.7295
  • HourlyWindSpeed has 68 outliers

HourlyCloudAmount

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.2%
Missing 32428
Missing (%) 87.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 72128
Mean 5.8864
Minimum 0
Maximum 10
Zeros 1031
Zeros (%) 2.8%
Negatives 0
Negatives (%) 0.0%
  • HourlyCloudAmount is skewed left (γ1 = -0.4288)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 1
Median 8
Q3 10
95-th Percentile 10
Maximum 10
Range 10
IQR 9

Descriptive Statistics

Mean 5.8864
Standard Deviation 4.1007
Variance 16.8156
Sum 26536
Skewness -0.4288
Kurtosis -1.5431
Coefficient of Variation 0.6966
  • HourlyCloudAmount is not normally distributed (p-value 4.355710315951026e-15)

HourlyPrecipitation

numerical

Approximate Distinct Count 11
Approximate Unique (%) 0.0%
Missing 9144
Missing (%) 24.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 444672
Mean 0.0641
Minimum 0
Maximum 1
Zeros 23862
Zeros (%) 64.6%
Negatives 0
Negatives (%) 0.0%
  • HourlyPrecipitation is skewed right (γ1 = 3.4907)

Quantile Statistics

Minimum 0
5-th Percentile 0
Q1 0
Median 0
Q3 0
95-th Percentile 0.5
Maximum 1
Range 1
IQR 0

Descriptive Statistics

Mean 0.0641
Standard Deviation 0.1987
Variance 0.03946
Sum 1781.4
Skewness 3.4907
Kurtosis 11.5988
Coefficient of Variation 3.0992
  • HourlyPrecipitation is not normally distributed (p-value 4.9883165651095715e-25)
  • HourlyPrecipitation has 3930 outliers

Hour

numerical

Approximate Distinct Count 24
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean 11.5
Minimum 0
Maximum 23
Zeros 1539
Zeros (%) 4.2%
Negatives 0
Negatives (%) 0.0%

Quantile Statistics

Minimum 0
5-th Percentile 1
Q1 5.75
Median 11.5
Q3 17.25
95-th Percentile 22
Maximum 23
Range 23
IQR 11.5

Descriptive Statistics

Mean 11.5
Standard Deviation 6.9223
Variance 47.918
Sum 424764
Skewness 0
Kurtosis -1.2042
Coefficient of Variation 0.6019
  • Hour is not normally distributed (p-value 8.530609293617627e-198)

DayOfYear

numerical

Approximate Distinct Count 112
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean 0.556
Minimum 0.002732
Maximum 0.9973
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • DayOfYear is skewed left (γ1 = -0.3099)

Quantile Statistics

Minimum 0.002732
5-th Percentile 0.01639
Q1 0.0765
Median 0.847
Q3 0.9235
95-th Percentile 0.9836
Maximum 0.9973
Range 0.9945
IQR 0.847

Descriptive Statistics

Mean 0.556
Standard Deviation 0.42
Variance 0.1764
Sum 20537.0492
Skewness -0.3099
Kurtosis -1.8566
Coefficient of Variation 0.7553
  • DayOfYear is not normally distributed (p-value 0.00043625695608008575)

DayOfYearTransformed

numerical

Approximate Distinct Count 102
Approximate Unique (%) 0.3%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 590976
Mean 0.1535
Minimum 0
Maximum 0.3169
Zeros 336
Zeros (%) 0.9%
Negatives 0
Negatives (%) 0.0%
  • DayOfYearTransformed is skewed right (γ1 = 0.0064)

Quantile Statistics

Minimum 0
5-th Percentile 0.01639
Q1 0.0765
Median 0.153
Q3 0.2295
95-th Percentile 0.2896
Maximum 0.3169
Range 0.3169
IQR 0.153

Descriptive Statistics

Mean 0.1535
Standard Deviation 0.08858
Variance 0.007847
Sum 5668.7213
Skewness 0.006413
Kurtosis -1.1685
Coefficient of Variation 0.5772
  • DayOfYearTransformed is not normally distributed (p-value 0.00032387438028069537)

Interactions

Correlations

Missing Values